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RESOLUTION  GRAPHS 


Abstract 


by 

Robert  A,  Yates 
Bertram  Raphael 
Timothy  P.  Hart 

This  paper  introduces  a  new  notation,  called  "resolution  graphs," 
for  deductions  by  resolution  in  first-order  predicate  calculus.  A 
resolution  graph  consists  of  groups  of  nodes  that  represent  initial 
clauses  of  a  deduction  and  links  that  represent  unifying  substitutions. 
Each  such  graph  uniquely  represents  a  resultant  clause  that  can  be 
deduced  by  certain  alternative  but  equivalent  sequences  of  resolution 
and  factoring  operations. 

Resolution  graphs  are  used  to  illustrate  the  significance  of  merges 
and  tautologies  in  proofs  Ig?  resolution.  Finally,  they  provide  a  basis 
for  proving  the  completeness  of  a  proof  strategy  that  combines  the  set 
of  support,  resolution  with  merging,  linear  format,  and  Loveland’s  sub¬ 
sumption  conditions. 


li 


I 


INTRODUCTION 


Automatic  theorem  proving  in  the  first-order  predicate  calculus 

has  become  an  active  and  fruitful  field  for  research,  particularly  since 

* 

Robinson's  landmark  paper  (1965)  introduced  the  resolution  rule  of 
inference.  Computer  programs  that  perform  logical  deduction  by  using 

some  variation  of  resolution  have  been  applied  to  a  variety  of  problem 
domains,  including  lattice  theory  (Guard,  et  al,,  1969),  question'-answerii^ 

systems  (Green  and  Raphael,  1968),  and  problem-solving  tasks  (Green,  1969). 

The  major  reason  these  programs  have  had  only  limited  effectiveness 
is  that  they  require  excessive  amounts  of  computer  time  and  space.  This 
is  largely  due  to  the  weakness  of  existing  strategies  for  deciding  how 
to  apply  the  resolution  rule.  Robinson  presented  one  logically  complete 
(but  grossly  inefficient)  strategy  for  using  resolution  to  prove  theorems. 
Since  then  numerous  papers  have  appeared  describing  more  restrictive 
strategies  that  are  also  complete.  The  nature  of  some  of  these  strategies 
is  syntactic,  i.e.,  they  depend  upon  the  identity  or  number  of  elementary 
symbols  (Wos  et  al.,  1964;  Wos  et  al.,  1965;  Andrews,  1968;  Loveland,  1970), 
Other  strategies  are  of  a  semantic  nature,  i.e.  they  depend  upon  assign¬ 
ments  of  models  or  truth  values  (Slagle,  1967;  Luckham,  1968) . 

In  this  paper  we  prove  the  completeness  of  an  extremely  restrictive 
s5Titactic  strategy.  Raphael  (1969)  presented  part  of  this  strategy  in 

References  appear  at  the  end  of  this  paper. 
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an  informal  note  a  year  ago.  (This  paper  supercedes  that  note.)  Since 
then,  Its  completeness  has  been  independently  established  by  Anderson  and  - 
Bledsoe  (1970)  . 

The  completeness  of  this  strategy  was  discovered  by  us  while  working 
with  a  new  graphical  notation  for  resolution  deductions.  This  notation^ 
whose  justification  is  based  upon  the  mathematical  concept  of  a  partition, 
leads  us  to  a  clearer  understanding  and  simplified  proofs  for  several 
existing  theorems  in  resolution  theory.  Therefore  a  major  purpose  of 
this  paper  is  to  present  the  idea  of  a  resolution  graph  and  show  its 
usefulness . 

II  TERMINOLOGY 

We  shall  assume  familiarity  with  standard  terminology  and  notation 
of  first-order  predicate  calculus  and  proof  by  resolution.  In  summary^ 
all  logical  statements  are  assumed  to  be  in  quantifier-free  conjunctive- 
normal  formj  existential  quantifiers  are  eliminated  by  the  introduction 
of  Skolem  functions^  and  universal  quantification  is  assumed  over  varia¬ 
bles.  The  initial  information  is  represented  by  a  finite  set  of  clauses, 
each  of  which  is  a  set  of  literals .  (This  finite  set  is  called  the 
"clause  fonn"  of  the  predicate -calculus  statement  obtained  by  forming 
the  disjunction  of  the  literals  in  each  clause^  and  then  the  conjunction 
of  the  resulting  formulas) .  Each  literal  is  either  an  atomic  formula 
or  the  negation  of  an  atomic  formula.  An  atomic  formula  consists  of  a 
predicate  symbol  and  an  appropriate  number  of  terms  for  its  argiunents. 
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Each  term  is  either  a  constant^  a  variable^  or  the  composition  of  a 
function  applied  to  an  appropriate  number  of  terms  as  arguments. 

Resolution  may  be  taken  to  be  an  operation  mapping  two  parent  clauses 
B  and  C  into  a  "resolvent”  clause  D.  By  Robinson’s  definition^  the  clause 
D  is  a  resolvent  of  B  and  C  (which  have  been  "standardized”  to  have  no 
variable  names  in  common)  iff  (if  and  only  if)  there  are  nonempty  subsets 

b  c  B  and  c  c  C  such  that  the  atomic  formulas  in  b  U  c  are  unifiable  with 
a  most  general  unifier  a, b  a  =  {L  },  c  a  =  {Lo};  where  L  and  L  are  com- 
plementary  literals,  and  D  =  (B  ct  -  {h  })  U  (C  a  -  {L  }).  We  call  elements 
b  U  c  the  literals  resolved  upon.  (The  concepts  of  unifier  and  most  gen¬ 
eral  unifier  will  be  discussed  in  detail  in  the  next  section  of  the  paper.) 

Andrews  ( 1968)  broadens  this  definition  of  resolvent  by  not  requiring 
a  to  be  most  general.  This  is  equivalent  to  considering  every  substitution 
Instance  of  the  above  D  also  to  be  a  resolvent  of  B  and  C, 

A  common  restriction  of  resolution  is  simple  resolution,  in  which 
b  and  c  must  be  singletons.  It  can  be  shown  that  an  inference  system 
based  upon  simple  resolution  alone  is  not  complete. 

If  some  clause  D  has  a  subset  d  and  a  is  a  most-general  unifier  of 
the  literals  in  d^  do  =  |l|,  then  we  call  the  clause  Do  a  factor  of 
and  D  a  parent  of  its  factor.  Since  a  clause  implies  all  of  its  instances, 
clearly  a  clause  implies  all  of  its  factors.  One  can  show  that  simple 
resolution  and  factoring  can  form  a  complete  inference  system.  [Although 
factoring  played  a  major  role  in  an  early  unpublished  version  of  Robinson  (1965) 
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and  was  Implemented  in  the  first  resolution  program  (Wos  et  al.  (1964, 


1965)),  it  has  been  little  discussed  in  the  literature,  A  variation 
called  distinguished  literal  factoring  is  discussed  by  Kowalski  and  Hayes 
(1969)],  In  fact,  resolution  in  Robinson's  sense  may  be  viewed  as  a 
simple  resolution  that  has  been  preceded,  if  necessary,  by  appropriate 
factoring  operations  on  the  parents.  In  this  paper  we  shall  generally 
view  a  resolution  inference  step  as  consisting  of  the  two  phases:  factoring 
followed  by  simple  resolution. 

A  deduction  of  clause  C  from  an  initial  set  of  clauses  S  is  a  finite 

sequence  of  clauses  B, ,  B„,  B  such  that: 

12  n 

(1)  B.  1  ^  i  ^  n,  is  either  in  S  or  it  is  a  resolvent  of  B. 

1  J 

and  B  ,  1  ^  j,  k  <  i.  and 

K 

(2)  B  =  C. 

n 

A  refutation  of  a  set  S  of  clauses  is  a  deduction  from  S  of  the  empty 
clause,  which  is  denoted  by  j]]  ■  The  usual  way  to  attempt  to  verify  that 
a  theorem  T  is  deducible  from  a  set  of  axiomatic  clauses  G  is  to  attempt 
to  construct  a  refutation  of  the  set  p  U  where' O'  is  the  clause  form 
of  the  negation  of  the  theorem  T. 

If  D  is  either  a  resolvent  or  a  factor^  each  literal  L  e  is  equal 
to  h'a  for  at  least  one  literal  in  a  parent  clause  (and  the  appropriate 
substitution  a) .  Every  such  l'  is  called  a  parent  of  literal  L.  Thus 
in  any  deduction  of  a  clause  D  from  a  set  S  of  initial  clauses^  we  may 
trace  the  ancestry  of  the  literals  in  D  back  to  literals  in  the  members 
of  S. 

Andrews  (1968)  defines  a  merge  to  be  any  deduced  clause  containing 
a  literal  that  has  parent  literals  in  both  parent  clauses.  We  find  it 


4 


convenient  to  extend  this  concept  by  defining  a  demerge  ("descent  merge") 
to  be  any  clause  containing  any  literal  L  that  has  ancestors  in  two  or  more 
distinct  occurrences  of  members  of  S,  and  we  call  such  an  L  a  d-merge  literal . 
Every  d-merge  is  thus  either  a  merge  or  the  descendant  of  a  merge.  Since  the 
formation  of  a  merge  always  causes  two  or  more  literals  in  different  initial 
clauses  to  become  associated  (by  virtue  of  becoming  ancestors  of  the  same 
literal),  we  shall  sometimes  use  the  term  "merge"  merely  to  refer  to  this 
association,  which  is  exhibited  as  an  explicit  link  in  the  "resolution  graphs" 
to  be  defined  below.  Note  that  L  may  have  two  or  more  ancestors  and  yet  not 
be  a  d-merge  literal  (when  all  its  ancestors  occur  in  the  same  occurrence  of 
a  member  of  S) . 

Subsumption  is  an  important  phenomenon  in  most  resolution  proof 
strategies.  For  any  two  clauses  C  and  D,  C  is  said  to  subsume  D  if 
an  instance  of  C  is  contained  in  D,  i.e., there  exists  a  substitution  ct 
such  that  Ccr  C  D.  If  D  is  subsumed  by  C,  it  is  implied  by  C  and  generally 
may  be  replaced  by  C  in  the  construction  of  a  deduction.  Note  that  a 
tautology  (a  clause  containing  a  pair  of  complementary  literals)  cannot 
subsume  any  nontautological  clause,  since  substitution  cannot  destroy 
the  tautologousness . 

Ill  PARTITIONS  AND  UNIFICATION 

We  shall  now  explore  some  of  the  properties  of  the  unification 
operation.  The  goal  of  this  discussion  is  to  clarify  the  possible 
effects  of  composing  or  permuting  substitutions. 

Unification  Theorem 

Let  E  =  ®n|  ^  expressions  (e.g.^  all 

the  atomic  formulas  that  occur  in  some  set  S  of  initial  clauses)  .  Let 
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£  -  {e^,  Eg,  E^}  be  a  class  of  nonempty  subsets  of  E  (e.g.,  think 

of  each  E^  as  containing  a  different  set  of  atomic  formulas  from  E  that 
might  be  made  identical  by  an  appropriate  substitution) .  Thus  E^  c  e 
and  <f)  (the  empty  set)  for  all  i.  If  0  is  any  substitution,  then 

£0  =  Ie  0,  E„0,  E  0I,  where  E  0  is  the  set  of  expressions 

{1^2  ’my  i 

e  0  obtained  by  making  the  substitution  defined  by  0  in  each  e  in  E  . 
j  j  i 

A  class  £  is  said  to  be  unif iable  if  there  exists  a  substitution 
0  such  that  E  0  is  a  singleton,  i.e.,  contains  only  one  element,  for 
every  i;  and  such  a  0  is  said  to  unify  £• 

The  well-known  unification  theorem  can  be  stated  as  follows: 
Unification  Theorem — ^Let  £  be  a  unifiable  class  of  subsets  of  a  set  E  of 


expressions.  Then  there  exists  a  most  general  unifier  a  of  £  with  the 

£ 

property  that  for  any  unifier  0  of  £,  there  is  a  substitution  X  such 
that  0  is  the  composition  cr„  o  X  of  substitutions  o_  followed  by  X,  Thus 
every  unifier  of  £  is  an  instance  of  the  most  general  unifier  of  £.  (A 
proof  and  discussion  of  the  Unification  Theorem  appears  in  Robinson  (1967).) 

Clearly,  "the"  most  general  unifier  of  a  class  is  not  unique; 
any  most  general  unifier  of  £,  when  composed  with  any  invertible  sub¬ 


stitution,  is  again  a  most  general  unifier.  For  example,  the  class 


has  both  0  =  [g(y)/x,  y/ul  and  0_  =  [g(u)/x,  u/y"]  as  most  general 

unifiers .  However,  0  can  be  obtained  from  0  by  the  invertible  sub- 

X 

stitution  [y/u,  u/y].  In  fact,  it  is  a  corollary  of  the  Unification 

Theorem  that  this  is  the  only  way  most  general  unifiers  of  a  set  can 

be  related:  If  a  and  a'  are  most  general  unifiers  of  they  are 
o  o 

instances  of  one  another  and  consequently  are  alphabetic  variants  of 


each  other.  Since  all  such  a  are  essentially  equivalent,  henceforth 

c 

we  shall  assume  that  a  is  unique. 

o 


Partitions 

Let  £  =  Ie  ,  E  ,  ...  E  i  and  3  =  !f  ,  F  ,  F  j  be  two  classes 

(12  m)  I  1  2  n) 

of  nonempty  subsets  of  a  set  E  of  expressions  (e.g.,  if  E  contains  all 
the  atomic  formulas  in  some  set  S  of  initial  clauses,  let  the  E.  be  sets 
of  potentially  imifiable  atomic  formulas  from  some  subset  of  S,  and  let 
the  F.  be  the  corresponding  sets  for  a  different,  perhaps  larger,  sub¬ 
set  of  S.) 

We  define  the  following  partial  ordering  on  the  classes  of 

subsets  of  E;  £  s  3  iff,  for  each  i,  E  C  F  for  some  j. 

"  i  “  J 

The  class  £  is  called  a  partition  provided  E  ,  ...,  E  are  mutually 

-  1  m 

disjoint  sets.  By  the  closure  of  £,  denoted  by  [£],  we  mean  the  small¬ 
est  partition  such  that  £  £  [£] .  The  class  [£]  is  formed  from  £  by 

successively  merging  together  sets  E  E  .  of  £  with  an  element  in  common 

i  3 

until  all  the  sets  so  obtained  are  mutually  disjoint. 
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Example :  If  e  ^  jg(x^y),  b,  h(u)| 

{f(x),  h(u),  c|,  jg(f(x),y),  c^l 
then  [&]  =|{aj,  jx,  f  ( y) ,  uj,  jg(x,y),  b,  c,  f(x),  h(u). 


g(f(x) 


Induced  Partitions 


If  E  is  a  set  of  expressions  and  6  a  substitution,  then  0  induces 

a  partition  P^  on  E  defined  as  follows:  Two  expressions  e.  and  e.  in 

E  lie  in  the  same  block  of  P„  iff  e  0  =  e  0,  For  example, 

0  1  j 


if  E  =  f{x),  u,  g(x,y) ,  f(h(v)) 


and  0  =  [h(v)/x,  g(ii(v)  ,y)/u] ,  then  0  induces  the  partition 


Pg  =  I  jf(x),  f(h(v))j  ,  ju,  g(x,  y)  I  I  on  E. 


If  £  is  a  unifiable  class  of  subsets  of  E,  and  0  unifies  6,  then 


the  partition  P  induced  by  0  on  the  expressions  occurring  in  £.  has  the 
0 

property  that  £  S  P  because  if  E  e  £  then  all  the  expressions  in  E 

0  i  i 

are  unified  by  0  and  hence  E  is  contained  in  a  single  block  of  P  . 

i  6 

Since  the  closure  t£]  of  £  is  the  smallest  partition  containing  £,  we 

have  £  ^  [£]  S  P  .  Conversely,  a  class  £  is  unifiable  if  £  S  P  (or 
9  9 

[£]  £  P^  for  some  substitution  0. 
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Now,  if  a„  is  a  most  general  unifier  of  £.,  then  a  =  ^rpi* 
ty  ^  LO  J 

is  true  because  if  two  blocks  of  g.  contain  an  element  e  in  common,  any 
unifier  0  of  6-  must  unify  both  blocks  to  the  same  element  e0.  Therefore, 
the  unifier  of  a  class  of  sets  of  expressions  depends  only  upon  the 
partition  [£•],  the  closure  of  the  class.  It  also  follows  that  if  g  and 
S'  are  classes  such  that  S'  ^  £  and  £  is  unifiable,  then,  since  [£  U  S']  = 
[£],  we  have: 


^£  U  SF  " 


Equivalent  Substitutions 

The  following  lemma  shows  an  important  invariance  of  unification 
to  the  order  in  which  substitutions  are  performed: 

Lemma  1 :  If  £  and  S'  are  unifiable  classes  with  most  general  unifiers 


a  and  a  respectively,  then  £  U  S'  is  unifiable  if  and  only  if  . 

£  2F  ^^^£^ 

exists  (i.e,,  S'a  is  unifiable).  In  this  case  a,_  .  also  exists,  and 

C-  vCSO’^/ 

®  ,  and  a  o  .  are  both  most  general  unifiers  of  £  11  S'. 

£  (J5a^)  S?  (£ag.) 
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Proof:  We  first  show  that  If  £  U  3  is  unifiable,  then  ^  exists 

and  that  a„  „  =  cr„  o  a,  ^  .  Let  0  be  any  unifier  of  £  (J  U.  Since  0 

£U3  £ 

unifies  we  can  write  9  ~  o  K  for  some  substitution  K.  But  for 

o 

each  F.  eS,  F.0  z=  (F.o-)^  is  a  singleton^  so  \  unifies  Uo  .  Consequently, 

1  O 


£ 


a  _  exists,  and  X  =  a,„  .  o  |-i  for  some  n;  so  we  have  0  =  O-  o  o,^  o  [x. 


£ 

Since  o„  °  a,  ^  . 

£  (3^0^) 


'£'  ■  '“  £ 
does  in  fact  unify  £  U  3f  and  since  any  unifier  0  of 


£  U  S'  can  be  factored  as  0  =  a„  o  a,„  x  ®  ^  for  some  [i,  then  Oo  o  cr^_  . 

is  in  fact  a  most  general  unifier  of  £  IJ  S',  Conversely,  if  we  assume 

that  X  does  exist,  then  o  a, _  .  unifies  £  U  3f,  and  the  above 

(S'O^)  £  vS'a-)  ^ 

£  £ 

argument  shows  that  it  is  its  most  general  unifier.  The  result  is 
clearly  symmetric  in  £  and  3". 

An  immediate  corollary  is  that  if  ^  ^  £,  then 


^3?  °  "^(£0^) 
S' 


(sines  Sg  = 


Tills  says  that  the  most  general  unifier  o  of  £  can  be  computed  by  first 
unifying  a  subclass  ^  of  £  and  then  unifying  the  remainder  of  £  with 


the  unified  S',  By  induction  we  have  a  second  corollary: 

Corollary :  Let  £  =  .  ,E^]  be  a  class  of  sets  of  expressions. 

<^2  =  ^1  °  ^n  =  ^n-1  “  ' 


Then  exists  if  and  only  if  a, ,  a  all  exist,  and  ct_  =  a  • 

£  ±  n  £  n 


Proof:  For  n  =  1  there  is  nothing  to  prove. 

For  n  >  1  let  S'  =  By  the  induction  hypothesis, 


10 


cr„  exists  iff  a,  ..,cr  ,  exist  and  ct_  =  cr  So  ct„  =  a-  o 

3f  1  n-1  5  n-1  &  5 

cr|-g  and  exists  iff  exists  i.e.,  iff 

n  3^  n  3^ 

all  exist . 

Since  a  is  independent  of  the  sequence  numbering  of  E,,...,E  ,  it  is 
G  In 

clearly  independent  of  the  sequence  of  operations  used  to  compute  it. 


Example :  Let  &  =  ||x,'  f(u,  g(v)),  f(a,y)|  ,  -^gCb),  y^j 

S  g(v)),  f(a,y)||  so  3  ^  G 

then  =  [a/u,  g(v)/y],  Ga^.  =  ||x,  f(a,g(v))|  ,  |g(b),  g(v)| 

‘^(Ga  )  ~  [f (a,g(b))/x,  b/v] 

=  a^.  o  )  =  ta/u,  g(b)/y,  f(a,g(b))/x,  b/v3 

3? 


IV  RESOLUTION  GRAPHS 

One  of  the  main  results  of  this  paper  is  the  fact,  to  be  stated 
formally  in  Theorem  1,  that  certain  alternative  sequences  of  simple 
resolution  and  factoring  operations  produce  precisely  the  same  resul¬ 
tant  clause  .  The  purpose  of  the  graphical  notation  developed  in  this 
section  is  to  represent  uniquely  certain  equivalent  sets  of  deductions. 

Let  S  be  a  set  of  initial  clauses,  and  let  C  =  [l, ,  L_,  L  1 

12  n 

be  a  clause  in  S  where  L, ,  L  are  the  literals  of  C.  We  represent 

1  n 

C  as  a  graph  by  associating  a  circle  to  each  literal  of  C  and  by 

connecting  the? circles  by  horizontal  bars ? (Figure  la).  Such; a* graph  is 
called  an  initial  graph  since  it  represents  an  initial  clause.  The 
circles  of  the  graph  are  named  by  their  corresponding  literals  L^,  . . . ,  L^ 


and  can  occur  in  any  order;  two  graphs  that  differ  only  because  the 
literals  within  a  clause  are  differently  sequenced  will  be  considered 
equivalent.  These  initial  graphs  are  the  building  blocks;  more  compli¬ 
cated  graphs  are  always  constructed  from  these  by  operations  correspond¬ 
ing  to  resolution  and  factoring.  Before  stating  a  formal  definition  of 
the  graph  structures  we  call  resoluti on  graphs,  we  first  give  some  in¬ 
stances  of  how  these  graphs  are  formed  and  introduce  some  additional 
terminology. 


If  D  =  {m, ,  M  1  is  another  initial  clause  which  is  resolved 

1 

against  C  on  the  literals  and  ,  we  represent  the  resolvent  clause 

R(C,D)  as  a  graph  by  connecting  the  two  corresponding  circles  (in  the 

initial  graphs  for  C  and  D)  by  a  double  bar  (Figure  lb) .  The  literals 

of  the  clause  R(C,D)  are  the  names  of  the  circles  in  the  new  graph  which  are  not 


connected  by  double  bars.  These  literals  (marked  by  primes  in  Figure  lb) 
are  assumed  to  be  those  of  the  initial  clauses  Instantiated  by  the  most 

general  unifier  of  the  atomic  formulas  of  L  and  M.  (which  reduced  L.  and  M. 

■ '  . i' . . . . J~‘ . . . .  1  ‘  3 

to  the  complementary  literals  and  mO .  These  instantiated  literals 
are  said  to  be  associated  to  their  corresponding  circles.  However,  we 
shall  continue  to  refer  to  the  circle  by  its  original  literal  name. 

Should  two  or  more  literals  collapse  together,  i.e.,  become  identical, 
as  the  result  of  a  resolution  operation,  we  indicate  this  in  the  graph 
by  connecting  the  literals  (circles)  together  by  a  dotted  line — thereafter 
forcing  these  circles  to  be  considered  as  a  single  node  (see  Figure  Ic) . 

(We  define  a  node  of  a  graph  to  be  a  dot -connected  group  of  circles — or 
a  single  circle  which  is  not  dot-connected.)  For  example,  if  the  most 
general  unifier  of  and  M.  also  happens  to  make  and  into  identical 
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literals,  we  represent  the  resolvent  clause  by  the  graph  shown  in  Figure 

Ic.  However,  we  prefer  to  consider  the  graph  operations  of  adding  double 

bars  and  dotted  lines  as  independent  operations.  Simple  resolution  applied 

to  graphs  is  accomplished  by  a  double  bar  operations  followed  by  dotted  line 

operations  if  the  literals  should  collapse.  In  general,  we  allow  graphs  in 

which  two  distinct  circles  have  the  same  associated  literal  without 

demanding  that  the  circles  be  dot-connected . 

The  dotted  line  is  also  used  to  indicate  an  explicit  factoring 

operation.  If  and  in  Figure  lb  are  distinct  but  unifiable,  then 

Figure  Ic  represents  the  result  of  applying  the  factoring  substitution 

to  the  resolvent  clause.  The  associated  literals  L/ ,  ....  M'  would  be 

1  1 

replaced  by  their  respective  instances. 

We  say  a  graph  node  is  free  if  it  is  not  connected  to  any  other 
node  by  a  double  bar,  and  a  circle  is  free  if  the  node  to  which  it 
belongs  is  free.  Graph  operations  are  only  performed  on  free  nodes. 

As  an  example,  let 

h  ^2  S 


=  [P(x),  Q(a),  R(a.x)} 

L,  L 

4  5 

C  =  Q(y) ,  R(y,b) } 


Cg  =  [~P(b)} 


R(a,z),  P(f(z))}  .  . 


(See  Figure  2a),  Let  R  be  the  resolvent  of  C  and  C  : 

J.  J.  ^ 


Let  R  be  the  resolvent  of  R  and  C  : 

^  X  o 

R^  =  [R(a,b)}  , 

and  let ’R  be  the  resolvent  of  R  and  C  : 

R  =  [P(f(b))} 

(Figure  2bi) 

We  have  labeled  the  graph  operations  o',  3.7,  6  as  seen  in  Figure  2b 
where  y  is  the  operation  of  adding  the  dotted  line.  So  the  sequence  of 
graph  operations  leading  to  R^  is  However,  we  could  obtain 

the  same  result  by  performing  the  operations  in  any  one  of  the  following 
orders; 

(2)  <0,3,7, 6) 

(3)  <6.7»Q'»6) 

<4)  <3, 7, 6,0) 

For  example,  the  sequence  <6, 7, 6, O')  corresponds  to  first  resolving  and 

C  then  factoring  L  and  L  then  resolving  with  C,  and  finally  with  C  . 

00  4  3 

It  is  a  consequence  of  the  corollary  to  lemma  1  that  any  two  sequences 
of  graph  operations  which  lead  to  the  same  final  graph  structure  produce 
the  same  resulting  clause. 

In  Figure  2,  the  literals  in  parentheses  are  the  associated  literals; 
each  associated  literal  is  associated  to  a  graph  node  and  is  an  instance 
of  one  of  the  original  literals  naming  a  circle  of  the  node.  Since  we 
shall  show  that  these  literals  do  not  depend  on  the  order  of  graph  opera¬ 
tions,  we  shall  subsequently  drop  these  literals  from  the  diagrams. 
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"merge  In  a  graph'^Avill  always  refer  to  a  dotted  link  between 
literals  in  distinct  initial  clauses.  The  resultant  clause  of  a  graph 
containing  such  a  merge  link  is  itself  either  a  merge  of  a  d-merge,  but 
we  shall  not  always  be  able  to  tell  which  (and  the  distinction  will  be 
unimportant  in  our  development) , 

Graph  Structure 

In  the  preceding  discussion  we  described  the  resolution  graphs 
corresponding  to  certain  deductions.  We  now  give  a  recursive  defi¬ 
nition  for  the  class  of  structures  called  resolution  graphs  or  simply 
graphs,  in  the  remainder  of  the  paper.  (Note;  we  assume  in  the  follow¬ 
ing  that  every  occurrence  of  an  initial  clause  contains  a  unique  set  of 
variable  symbols,  so  that  no  naming  conflicts  arise  during  the  process 
of  forming  graphs.) 

Definition  of  Graph 

(1)  The  representation  for  any  initial  clause  by  a  connected  row 
of  circles  is  a  graph,  and  is  called  an  initial  graph. 

(2)  If  G  is  a  graph,  and  two  associated  literals  from  distinct 
free  nodes  may  be  made  identical  by  the  most 

general  unifier  the  result  of  connecting  together  all  the 
circles  in  the  two  nodes  by  dotted  lines  is  a  graph.  The  new 
connected  group  is  a  single  free  node  of  the  new  graph  and 
each  associated  literal  is  replaced  by  its  ^-Instance. 

(3)  If  G  and  G  are  graphs  and  the  associated  literals  of  a  free 

1  ^ 

node  N  of  G  and  a  free  node  N_  of  G  may  be  unified,  the 
XX  ^  ^ 

result  of  dot  connecting  the  circles  of  N  and  N  is  a 

X 

graph  G.  Again,  the  new  dot-connected  group  is  a  single  node 
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of  G  and  the  associated  literals  are  replaced  by  their  respec¬ 
tive  instances  (under  the  unifying  substitution) . 

(4)  If  G  and  G  are  graphs,  and  if  the  associated  literals  of 
1  2 

a  free  node  of  G^  and  of  a  free  node  of  G^  may  be 

made  complementary  (i.e.,  identical  atomic  formulas  and 

opposite  signs)  by  an  appropriate  unifier  then  the  result 

of  connecting  N  and  N  by  a  double  bar  is  a  graph  G.  N  and 
1  ^ 

N  are  no  longer  free  in  G  and  the  associated  literals  of  the 
2 

nodes  in  G  are  X-instances  of  the  associated  literals  of  G^  and  G, 

(5)  Only  hhose  structures  which  can  be  built  up  by  a  finite  number 
of  applications  of  rules  1,  2,  3  and  4  to  some  set  of  initial 
clauses  are  graphs. 

Examples:  Figure  3  shows  some  possible  and  impossible  graph  struc¬ 

tures  . 

Definition:  A  set  of  literals  (or  circles)  in  a  graph  is  said  to 
be  unifier-connected  if  the  literals  are  connected  by  double  bars  or 
dotted  lines  or  both;  e.g.,  the  graph  in  Figure  4a  contains  four  unifier- 
connected  sets: 


Definition:  Let  F  be  the  class  of  unifier-connected  sets  of  liter- 
• - - -  G 

als  in  a  graph  G,  except  that  all  negation  signs  are  omitted,  so  that  £ 

G 

is  a  class  of  sets  of  atomic  formulas.  The  most  general  unifier  of  the 


graph,  denoted  by  a  ,  is  defined  to  be  the  most  general  unifier  of  £ 

G  G 
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In  tlie  graph  construction,  the  sets  in  are  successively  unified; 
hence,  by  the  corollary  to  lemma  1,  always  exists  (assuming  G  exists) 
and  the  associated  literals  of  the  nodes  of  G  are  given  by  applying 
to  the  original  literals  of  G.  This  argument  proves  that  a  resolution 
graph  is  independent  of  the  sequence  of  operations  by  which  it  is  con¬ 


structed.  The  graph  partition,  denoted  by  P  ,  is  the  partition  induced 

by  a  on  the  set  of  atomic  formulas  occurring  in  G.  Since  a„  may  unify 
G 

atomic  formulas  which  are  not  in  we  always  have; 

u 

(1)  ^  ^  and 


(2) 


Repetitions 

In  resolution  theorem  proving,  one  generally  considers  the  set  S 
of  initial  clauses  to  be  of  fixed  size,  even  though  a  single  member  of 
S  may  be  used  several  times  in  a  proof  and  each  such  use  requires  a 
new  alphabetic  variant  of  the  clause.  In  working  with  graphs,  we  find 
it  more  convenient  to  consider  each  occurrence  of  an  initial  clause, 
represented  by  a  row  of  circles,  to  be  distinct.  Therefore,  the  "set 
of  initial  clauses"  of  a  graph — or  the  "set  of  initial  graphs"  may  con¬ 
tain  repetitions  of  members  of  the  "set  of  initial  clauses"  in  the  pred¬ 
icate  calculus  sense.  Similarly,  when  discussing  the  "literals"  of  a 
graph  we  consider  each  node  in  the  structure  to  be  associated  with  a 
distinct  literal  (its  associated  literal)  even  though  the  literals  rep¬ 
resented  by  two  or  more  of  them  may  be  syntactically  identical.  With 
these  ideas  in  mind,  we  find  the  concepts  of  g-clause  ("graph-clause") 
and  subgraph  useful  . 
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Definition:  A  g-clause  is  an  unordered  tuple  of  literals.  Two 
g-clauses  are  equal  if  every  member  of  one  is  a  member  of  the  other,  and 
occurs  with  the  same  multiplicity. 

Definition:  The  resultant  of  a  graph  G  is  the  g-clause  of  associated 
literals  of  the  free  nodes  /of  G.  The  resultant  may  be  constructed  by 
first  forming  a  g-clause  of  initial  literals,  one  from  each  free  node  of  G, 
and  then  instantiating  each  member  by  a  .  Each  g-clause  detennines  a 
unique  set  of  literals  (deleting  multiple  occurrences)  and  hence  the 
resultant  determines  a  unique  clause  called  the  resolvent  of  the  graph/ 

Definition:  A  graph  G^  is  said  to  be  a  subgraph  of  another  graph 
G  provided  G  is  constructable  from  G'  by  applications  of  rules  1,  2,  3 
and  4  defining  graph  constructions.  We  also  call  G  an  extension  of  G'. 

We  note  that  the  relation  "G  is  a  subgraph  of  G  "  is  clearly 

X  ^ 

transitive,  and  that  every  initial  graph  in  a  graph  G  is  a  subgraph  of  G. 
Two  subgraphs  G,  and  G„  of  a  graph  G  are  said  to  be  disjoint  if  they 

JL  ^  ■■  ■■ 

have  no  Initial  graphs  in  common.  If  G  and  G  are  disjoint  they  can 

X  ^ 

have  at  most  one  double  bar  link  connecting  them.  (There  may  be  one  or 
more  dotted  links  between  them,  however).  Figure  4(b)  shows  the  12 
possible  subgraphs  of  the  graph  in  Figure  4(a)  .  (Note  that  certain 
graphs  may  in  a  sense  be  contained  within  a  graph  and  yet  not  be  sub¬ 
graphs  of  it  as  defined  here.  For  example,  the  graph  formed  by  dot¬ 
connecting  L  and  L„  of  subgraphs  (1)  and  (2)  of  Fig.  4b  cannot  be 
Z  o 

constructively  extended  to  the  full  graph  of  Fig.  4a — or  even  to  sub¬ 
graph  (6)  .) 
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Another  interesting  property  of  subgraphs  is  that  they  can  al^vays 
be  replaced  by  their  resultants.  Thus,  if  is  a  subgraph  of  G  and  if 
is  the  resultant  of  G^,  we  can  construct  an  initial  graph  for  and 
replace  G^  by  this  new  graph  in  G  without  affecting  the  resultant  of  G. 
This  is  possible  since  each  graph  operation  used  in  constructing  G  from 
G^  affects  only  the  free  nodes  of  G^,  and  their  associated  literals. 
However,  each  such  free  node  and  corresponding  literal  is  represented 
imiquely  in  the  graph  of  C^. 

Deductions 

As  mentioned  earlier,  the  introduction  of  a  dotted  line  within  a 
graph  corresponds  to  a  factoring  operation,  and  the  introduction  of  a 
double  bar  between  two  graphs  corresponds  to  simple  resolution.  Because 
of  the  distinction  between  g-clauses  and  ordinary  clauses,  and  because 
the  double  bar  operation  does  not  account  for  literal  collapses,  these 
correspondences  are  not  complete.  However,  we  can  define  simple  resolu¬ 
tion  and  factoring  in  terms  of  the  graph  operations  and  consequently 
obtain  a  precise  correspondence  between  deductions  and  resolution  graphs 
(or  an  appropriate  subset  thereof) . 

Definition:  Let  B,  .  B  be  a  deduction  of  a  clause  C  (B  =  C) 

from  a  set  S  of  initial  clauses.  We  define  the  graph  generated  by  the 
deduction  as  follows: 

(1)  Each  occurrence. of  an  initial  clause  of  S  in  the  deduction  is 
representedsbyisa  separate  initial  graph. 

(2)  For  each  factoring  step,  we  apply  the  factoring  substitution 
to  the  free  associated  literals  and  dot -connect  those  nodes  whose 
associated  literals  have  been  unified  by  the  substitution  (by  step  2  in 

the  definition  of  a  graph). 
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(3)  For  each  simple  resolution,  the  appropriate  double  bar  is 
added  to  the  graph  (by  step  4) .  Again,  if  any  associated  literals  from 
distinct  free  nodes  have  been  unified,  we  dot-connect  these  nodes. 


Thus,  each  clause  in  the  deduction  is  represented  by  a  unique  graph 

G.  whose  resultant  (or  resolvent  in  this  case)  is  B.  (or  a  variant).  A 
1  1 


graph  that  can  be  generated  in  this  way  by  some  deduction  is  called  a 


deducible  graph. 
Dominance 


Definition:  A  resolution  graph  dominates  another  graph  G,, 

X  -  -  -  -•  ^ 

(written  G  &  G  )  provided  the  resolvent  of  G  subsumes  the  resolvent 
X  A  X 

Of  G^. 

Using  the  graph  structures  and  associated  partitions  we  can  fre¬ 
quently  determine  by  inspection  whether  a  given  graph  dominates  another 
graph.  A  sufficient  criteria  for  G  to  dominate  G  is  that: 

J.  ^ 

(1)  The  literals  naming  free  circles  of  G^  are  a  subset  of  the 
literals  naming  free  circles  of  G  . 

(2)  P  £  P  (alternatively  a  =  a  o  A.  for  some  A), 

°1  ^2  S  °1 

In  this  case  C  A  E  C  where  C  and  C  are  the  resolvents  (or  resultants) 

X  ^  X  ^ 

of  G  and  G  respectively.  One  example  is  that  the  graph  of  a  factored 
X  ^ 

clause  is  dominated  by  the  graph  of  its  parent .  A  more  interesting 
example  is  given  as  follows: 

Let  G^  be  the  graph  of  Figure  5a,  so  6^  = j  i  I  L^^  L  jl. 

Let  be  a  variant  of  and  let  G  have  the  structure  shown  in 
Figure  5b .  Then 


e  * 

G 
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Observe  that  although  G  is  not  deducible,  its  resultant  is  identical 


to  (or  a  variant  of)  the  resultant  of  the  variant  was  added  to 


2 

11.  .  L 


(  "It 

L„1 .  IL 


.  xj  ,  .  ixj  ,  xj  (  ,  1  xj  .  L  .  'L  ,  L  land  therefore  £  ^  + 

1^  3]’  I  2^  2l^  I  1^  4i’  I  5’  6  G„  G 


show  that  G^  (Figure  5c)  dominates  the  graph  G  ^  because 

Consequently,  the  resultant  of  G  subsumes  the  resultant  of  G  .  For 

2  1 

example,  suppose  the  initial  literals  in  Figure  5  are: 


=  --  P(u) 

Lg  =  Q(z) 

S  =  P(a) 

=  P(x) 

Lg  =  R(x) 

L  =  ~  R(y) 

6 

^  S(y) 

A*  A 

=  ~  P(u) 

A  A 

-  Q(z)  . 

Then  the  reader  can  verify  that  the  resultant  of  G^  or  G  is  the  clause 
|q(z),  S(a)|  ,  while  the  resultant  of  G^  is  the  stronger  clause 

jQ(z),  S(x)| ,  This  illustrates  the  principle  (to  be  proven  in  Lemma  2) 

that  in  general,  if  one  resolves  first  and  then  factors,  rather  than 
vice  versa,  one  obtains  a  "stronger"  clause,  i.e.  a  clause  that  subsumes 
the  clause  obtainable  by  first  factoring  and  then  resolving. 
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We  are  interested  in  the  dominance  relation  for  the  following  reason; 

In  section  V  we  shall  deal  with  the  construction,  subject  to  certain  con¬ 
straints,  of  a  proof  represented  by  a  certain  graph  G.  Analogously  with 
the  above  example,  it  turns  out  to  be  easier  to  construct  a  different 
but  dominating  graph  G'  ^  G.  This  is  generally  satisfactory  since  if 
C-q/  ^  then  for  any  clause  that  can  be  deduced  (by  simple  resolution 
or  factoring)  from  the  resultant  of  G,  there  exists  a  clause  at  least  as 
strong  that  can  be  deduced  from  the  resultant  of  G^ — as  we  shall  show  below. 

Definition:  A  graph  G  is  said  to  contain  a  loop  if  two  initial 
graphs  of  G  are  connected  by  chains  of  unifier-connecter  literals  in 
more  than  one  way , 

Thus,  in  Figure  4b,  the  subgraphs  6,  8,  9,  10,  11  and 
12  contain  loops  while  the  remainder  are  loop-free.  We  note  that  the 
only  way  loops  can  occur  in  deducible  graphs  is  as  a  result  of  merges. 

This  fact  will  be  important  in  our  later  development  of  a  proof  strategy. 

The  following  theorem  establishes  the  equivalences  of  certain 
graphs,  and  corresponding  deductions. 

Theorem  1:  (Resolution  graph  theorem).  Let  G  be  a  graph  represent¬ 
ing  a  resultant  g-clause  C  constructed  from  a  set  S  of  initial  clauses 
(including  possible  repetitions),  and  let  be  the  class  of  unifier- 
connected  sets  of  literals  from  G. 

1.  exists,  and  the  resultant  C  is  obtained  by  applying  to 

G  G 

the  literals  of  the  free  nodes  of  G.  If  G  is  deducible,  any  deduction 
that  generates  G  produces  the  same  resultant  clause  . 

2.  If  G  is  constructed  from  G  and  G  by  a  double  bar  operation, 

1 

/ 

and  if  G  dominates  G  ,  then  either  G'  dominates  G,  or  else  G'  and  G 
11  1  1  ^ 

can  be  double  bar  connected  producing  a  graph  G'  dominating  G. 
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1 


3.  Let  be  a  subgraph  of  G  having  as  its  resultant,  and  let 

D  be  any  g-clause  subsuming  Let  be  the  initial  clauses  used 

in  G^ .  Then  there  is  a  graph  G'  containing  initial  clauses  from 
(S  -  U  that  dominates  G. 

4.  There  is  a  deducible  graph  G'  s  G. 

Proof:  (1)  This  statement  has  already  been  proved  (Lemma  1  and 

subsequent  definitions) . 

(2)  Let  and  be  the  (free)  nodes  of  G^  and  G^  respec¬ 
tively  that  are  double-bar  connected  in  G;  let  L^^  and  L^  be  their  respective 
associated  literals.  Since  the  resultant  of  G^  subsidies  the  resultant 

of  G^,  we  have  X  C  for  some  Moreover,  every  literal  in 

is  free  in  G  except  L^^,  so  the  only  way  could  fail  to  subsume  C  is  if 

L^  is  a  member  of  C'  X  (i.e.,  L^  =  L'  X  for  some  literal  L'  in  Cp .  In 

that  case  G^  can  be  doable  bar  connected  to  G  on  the  nodes  corresponding 

to  l'  and  L  ,  and  the  graph  obtained  clearly  dominates  G.  (Should  G'  have 
1  ^  1 

more  than  one  free  node  having  L^  as  an  associated  literal  we  must  first 
dot-connect  those  nodes  in 

(3)  To  simplify  the  discussion,  we  first  replace  the  sub- 

A 

graph  G^  in  G  by  an  initial  graph  for  its  resultant  C^.  We  let  G  be 

the  graph  so  obtained;  clearly  g  has  the  same  resultant  C,  and  6;:and  is 
constructed  from  initial  clauses  in  (S  -  S^)  IJ  Our  proof  is  by 

/V 

induction  of  the  total  number  of  links  in  G  (dotted  links  plus  double- 

/\  A 

bar  links).  Since  G  is  a  graph,  G  is  constructed  in  one  of  four  ways; 

A 

(a)  G  is  an  initial  clause  which  is  necessarily  C^.  In 
this  case  the  number  of  links  is  0  and  we  take  G'  =  the 
graph  of  D. 
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(b)  G  is  formed  from  a  graph  G  by  adding  a  dotted  link. 

rfN  . . . 

By  induction,  since  has  fewer  links  thans  G,  there  V  i. 

is  a  graph  G'  ^  G  containing  initial  clauses  from  (S  -  S^)  IJ 
2  ^ 

{d} .  Since  G  ^  G  (G  is  a  factor  of  G  )  we  have  G'  ^  G. 

(c)  G  is  formed  from  two  subgraphs  G  and  G  by  adding  a 

a  ^ 

dotted  link.  (Note  that  G  and  G  each  dominate  G.)  If  G 

2  o  2 

is  the  subgraph  containing  C  (exactly  one  of  G  or  G  con- 

i  2  3 

tains  C^),  we  take  G^  to  be  the  graph  satisfying  the  theorem 
for  G^.  However,  G'  2:  G^  ^  G  so  G'  satisfied  the  theorem  for  G. 

(d)  G  is  formed  from  G  and  G  by  adding  a  double  bar  link. 

^  <5 

Again,  assume  is  contained  in  G^,  and  let  G^  be  the  graph 
satisfying  the  theorem  for  G  .  By  part  (2)  of  Theorem  1, 

A 

either  G^  s  G  in  which  case  we  set  G'  =  or  else  G'  and  G 

A 

can  be  double  bar  connected  to  produce  a  graph  G'  s  G.  Since 
G^  is  constructed  from  S  -  S^i  and  G^  from  (S  -  S^)  U  [d] 
we  see  that  G^  satisfies  the  theorem  in  either  case. 

(4)  (We  again  do  a  proof  by  induction  on  the  total  number  of 


links  in  G) , 


(a)  If  G  is  an  initial  clause  it  is  deducible, 

(b)  If  G  is  formed  from  G^  by  adding  a  dotted  link,  let 

G'  be  the  deducible  graph  for  G^.  But  G'  ^  2:  G  so  G' 

is  a  deducible  graph  for  G. 

(c)  If  G  is  formed  from  G^  and  G^  by  adding  a  dotted  link, 
let  G'  be  the  deducible  graph  for  G^ .  Again,  G'  s  G^  ^  G 
so  G '  is  a  deducible  graph  for  G, 
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(d)  If  G  is  formed  from  G  and  G  by  adding  a  double  bar, 

then  again  by  induction  let  G'  and  G'  be  the  deducible  graphs 

X  2 

for  G  and  G  respectively.  Now  either  G'  a  G  in  which 

12  1 

.  ca:se  we  set  g'  =  g'  ;  or  g'  ^^  G  in  which  base  we  set  G  '  =,50 ;  or 

X  ^  ^ 

G'  and  G'  can  be  double  bar  connected  to  generate  a  graph 
X  2 

G'  ^  G^ .  In  this  last  case,  we  must  dot-connect  any  nodes 
of  G'  containing  identical  associated  literals  in  order  that 
this  last  step  constitutes  a  valid  deduction. 

Observations 

1 .  Sometimes  a  deducible  graph  may  be  generated  only  by  performing 
the  resolutions  in  one  order;  attempting  a  different  order  causes  literals 
to  collapse  (by  the  induced  partition)  and  thus  disappear  from  the  graph. 

For  example,  consider  the  deductions  possible  from  the  three  clauses 
C  ,  and  Cm  where, 

2  O 


=(~  P(x,  a),  R(x)} 

C  ={P(b,  z)  ^  Q(z,  b)} 

2 

Cg  =£P(u^  a)  ,  ~  Q(a,  u)} 

If  we  start  by  resolving  with  we  can  get  (by  Figure  €a)  {p(b,  a) , 
R(b)}.  If  we  instead  start  with  and  C„,  the  deducible  graph  has  an 
induced  dotted  link  and  we  get  simply  P(b,  a)  (Figure  6b),  or  if  we  wish^ 
R(b)  (Figure  6c)  . 

This  phenomenon  does  not  contradict  the  theorem,  which  merely 
asserts  that  the  resultant  clauses  are  the  same  whenever  the  graphs 
are  the  same,  which  they  are  not  in  Figure  6.  Note  that  if  we  use  the 
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pure  graph  operation  of  double-bar  link,  rather  than  resolution,  we  can 
construct  Figure  6a  in  any  order.  Moreover,  whenever  we  get  collapses  in 
deduced  graphs  we  end  up  with  a  stronger  clause,  i.e.  one  that  subsumes 
the  clause  obtained  by  resolving  in  a  sequence  that  avoids  the  collapse. 
Problems  such  as  this  order-dependence  of  deductions  would  have  been 
avoided  if  we  had  defined  a  clause  by  a  graph  and  an  inference  step  by 
a  double-bar  link;  such  an  approach  would  have  resulted  in  a  somewhat 
more  elegant  presentation.  However,  the  idea  that  a  clause  is  a  set  and 
resolution  is  an  inferential  operation  upon  sets  is  well  entrenched  in 
the  literature  (and  in  various  computer  implementations).  Therefore, 
this  paper  has  taken  the  more  complicated  approach  of  explicitly  dis¬ 
tinguishing  deduced  graphs  and  collapsed  literals, 

2 ,  Any  tautological  subgraph  may  be  eliminated  from  a  non- 
tautological  graph  without  weakening  a  deduction,  i.e.  there  exists  a 
graph  without  the  tautological  subgraph  whose  resultant  subsumes  the 
resultant  of  the  original  graph.  (A  tautological  graph  is  one  whose 

resultant  contains  a  pair  of  complementary  literals.)  Since  the  entire 
graph  is  nontautological,  there  must  exist  a  subgraph  double-bar  . 

linked  to  the  tautological  subgraph  U  on  one  of  the  troublesome  literals; 
but  then  G^^  itself  dominates!  the /graph  consisting  of  C  , 
and  G^, and  by  the  theorem  may  replace  it  in  any  larger  graph.  For 
example,  consider  the  propositional  deduction  of  Figure  7.  This  graph 
may  be  deduced  with  no  problems  by  doing  the  lower  resolution  first.  If 
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the  upper  resolution  is  done  first,  the  tautological  subgraph  J  with 
resultant  [p,  ~P,s}  is  deduced.  However,  note  that  the  clause  D  =  {p,  r} 
subsumes  the  resultant  of  the  entire  graph  {p,r, s}. 

V  STRATEGIES  AND  REORDERING  THEOREMS 

In  this  section  we  prove  the  completeness  of  a  new  strategy  for  con¬ 
structing  resolution  proofs.  (Remember  that  we  use  "resolution"  to  mean 
optional  _^factoring  followed  by  simple  resolution.)  This  strategy  severely 
limits  the  alternative  next  steps  available  at  each  stage  in  a  proof  by 
superimposing  several  of  the  constraints  described  by  other  workers.  Our 
approach  is  to  show,  constructively,  how  to  transform  any  given  graph 
into  a  dominating  graph  that  can  be  generated  by  a  proof  satisfying  the 
constraints . 

The  distinction  between  resolution  with  an  initial  clause,  and 
resolution  with  a  clause  generated  by  previous  resolution  steps,  is 
an  important  aspect  of  the  class  of  theorem-proving  strategies  with 
which  we  are  concerned .  Consider  a  deducible  graph  containing  several 
dotted  lines.  By  Theorem  1,  the  order  of  operation  that  generated  the 
graph  is  unimportant,  and  therefore  any  dotted  lines  between  literals 
in  the  same  initial  clause  can  be  produced  by  the  factoring  part  of  a 
resolution  operation  at  the  time  that  clause  is  introduced  into  the 
proof.  On  the  other  hand,  a  dotted  line  between  literals  in  different 
initial  clauses  (a  d~merge  node)  cannot  be  generated  by  a  deduction 
until  a  subgraph  has  been  constructed  that  contains  all  the  parent 
clauses  (because  step  3  in  the  definition  of  a  graph  can  never  be  used 
when  the  graph  is  generated  by  a  deduction) .  This  is  the  source  of 
most  of  the  complication  in  the  following  presentation,  and  the  reason 
why  d-merges  are  important  in  proof  strategies . 
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Dot  Reduction 


The  following  lemma  states  that  any  particular  dot -connected  group 
in  a  graph  may  be  split  . 

Lemma  2:  Let  G  be  the  graph  formed  by  a  double-bar  connection  between  a 

literal  L  in  graph  G  and  the  dot-connected  nocfe  £  =  }l,  ,  L  I  of 

1  I  1"  n( 

group  of  circles  in  graph  Then  the  graph  G^,  obtained  from  G  by  removing  L^ 


from  Z,  double-bar  connecting  L  in  G^  to  L^,  and  double-bar  connecting 

A 

the  remainder  of  £  to  the  copy  of  L  in  G^  (a  new  variant  copy  of 

exists  and  dominates  G  (see  Figure  8a^  c) . 

Proof ;  If  we  factor  any  clause  together  with  an  alphabetic  variant 

of  itself^  we  obtain  essentially  the  same  clause  again.  Therefore  graph 
* 


G  ^  obtained  by  replacing  subgraph  G^  in  G  by  such  a  factored  pair  of 

variants^  is  equivalent  to  G  (Figure  8b) .  Ignoring  signs  as  usual^  the 

relevant  subset  of  G  +  affected  by  the  modification  that  is  required 

G 


by  the  lemma  is 


{{L^,  L^^  '''j  JJ’  ■' The=- 


Gorresponding 


-1  e 


■} 


il 


Since  the  remainders  of 
* 


subset  of  is  j  jL^,  L^,  -q 

and  S,  f  are  identical,  ^  Gr<*j  therefore  2:  G  ,  and  thus,  G^  S  G 

(j  ^  Q  h  ^  ^ 

Kote :  Since  the  ancestors  of  a  d-raerge  literal  in  initial  clauses  form 

a  dot-connected  group  in  a  graph,  this  lemma  will  be  useful  for  eliminating 

or  reducing  the  complexity  of  d-merges  in  deducible  graphs. 

Fishtail  Deductions 

The  proof  method  of  our  main  theorem^  Theorem  2  below^  will  be 
induction  upon  the  number  of  merges  in  a  graph.  t 

Our  next  lemma  establishes  a  strong  condition  upon  the  structure  of  a 
proof  that  generated  a  graph  containing  no  merges. 
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Definition :  A  fishtail  construction  of  a  graph  G  from  a  set  S  of 


initial  clauses  (including  possible  repetitions)  is  a  finite  sequence 
of  graphs  G  ,  G  ,  ...  G  such  that 

X  A  m 

(1)  G^  is  the  graph  of  a  clause  from  S,  called  the  starting  clause 
of  the  construction. 

(2)  G.,  1  <  i  ^  m,  is  the  graph  obtained  from  G^  ^  and  (the  graph 
of)  a  member  C  of  S  by  first  dot-connecting  a  group  of  nodes  in  G^ 

then  dot-connecting  a  group  of  nodes  in  C  and  finally  double-bar  connect¬ 
ing  the  resulting  nodes.  (The  dot-connecting  could  be  a  vacuous  operation 
in  either  case.) 

(3)  G  is  G. 

m 

Definition:  A  fishtail  deduction  of  a  clause  C  from  a  set  S  of 
initial  clauses  is  a  finite  sequence  of  clauses  B  .  B  ...^  B  such  that 

(1)  B  e  S  and  is  called  the  starting  clause  of  the  deduction 

(2)  B,l<i^n,  is  the  result  of  resolving  B  with  some 

i  i  -1 

member  of  S.  (Remember  that  "resolving"  means  simple  resolution  after 

optional  factorings  of  both  parents.) 

(3)  B  is  C. 

n 

Lemma  3: 

(1)  If  there  exists  a  fishtail  construction  of  a  graph  G  with 
clauses  from  S  and  starting  clause  B,  whose  resultant  g-clause  is  C, 
then  there  exists  a  fishtail  deduction  of  a  clause  C '  fhat  generates  a 
graph  G'  ^  G  with  clauses  from  S  and  starting  clause  B. 
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(2)  If  G  is  a  graph  containing  no  merges  (i,e.,  dotted  links 
between  literals  in  different  initial  clauses),  then  there  exists  a 
fishtail  deduction  that  generates  a  graph  G'  s  G  with  any  clause  in 
G  used  as  starting  clause. 

Proof : 

(1)  Starting  with  B,  follow  the  steps  of  the  fishtail  construction. 
For  each  addition  of  a  dotted  link  make  a  factoring  step,  and  lor  each 
double-bar  to  a  new  clause  make  a  simple  resolution.  By  the  argument  in 
the  proof  of  theorem  1  part  2,  this  is  always  possible  unless  the  graph 
deduced  at  the  previous  deduction  step  already  dominates  the  graph  con¬ 
structed  at  the  current  construction  step — in  which  case  no  correspond¬ 
ing  deduction  step  is  necessary.  This  deduction  sequence  produces  dedu- 
cible  graphs  dominating  the  corresponding  constructed  graphs,  so  that  at 
the  final  step  the  deduced  graph  G^  will  dominate  the  constructed  graph  G. 

(2)  Since  there  are  no  merges,  each  resolution  link  connects 
exactly  two  initial  clauses  and  those  two  clauses  are  not  connected  in 
any  other  way,  i  .e  .  the  graph  contains  no  loops.  Therefore,  by  theorem 

1  part  T, ,  a  fishtail  construction  of  G  may  be  generated  from  any  starting 
clause,  and  by  (1)  there  exists  a  corresponding  fishtail  deduction  of 
G'  ^  G. 

Note  that  the  fishtail  construction  is  a  procedure  for  "growing” 
a  graph  G  by  starting  with  a  single  clause  and  "adding"  additional 
initial  clauses,  thus  forming  successively  larger  subgraphs  of  G.  Fur¬ 
ther  growth  of  a  subgraph  in  this  way  is  impossible  only  when  each  con¬ 
nection  from  the  subgraph  to  other  parts  of  G  is  a  double-bar  link  to  a 
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merge — because  each  merge  links  at  least  two  initial  clauses,  and  a 
fishtail  procedure  requires  adding  initial  clauses  one  at  a  time. 

Examples:  There  clearly  exists  a  fishtail  construction  of  the  merge¬ 
less  graph  of  Figure  9a  from  any  starting  clause.  In  Figure  9b,  any 

fishtail  construction  must  start  with  either  C  or  C  ,  since  otherwise 

1  2’ 

the  dotted  link  cannot  be  introduced.  Figure  9c  is  an  example  of  a 
graph  for  which  there  does  not  exist  a  fishtail  deduction.  It  represents 
a  refutation,  for  example,  of  the  four  clauses  p  V  q,  p  V  q,  ~  p  V  q, 

and  ~  p  V~  q.  Either  the  left  or  right  subgraph  can  be  generated  in 
fishtail  fashion,  but  the  growth  process  is  then  blocked  by  the  double¬ 
bar  link  to  a  merge. 

Main  Theorem 

Theorem  2;  Let  G  be  a  graph  generated  by  any  deduction  D  of  a 
clause  C  from  an  initial  set  S.  Then  there  exists  a  graph  g'  and  a 
deduction  d'  generating  G'  with  the  following  properties; 

(1)  g'  ^  G. 

(2)  The  deduction  contains  a  linear  sequence  of  clauses  B^, 

B  ,  . . . ,  B  such  that 
2^  n 

(a)  ®  arbitrarily  chosen  to  be  any  element 

of  S  whose  graph  occurs  in  G. 

(b)  For  l<i^n,  B  is  a  resolvent,  one  of  whose  parents 

'  i  ^ 

(the  "immediate"  parent)  is  B  .  (The  other  parent  of 

i-1 

B  is  called  the  "far"  parent.) 
i 
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(3)  Either  the  far  parent  of  B  ^  1  <  i  S  is  in  S  (the  fishtail 
property)^  or  the  far  parent  satisfies  all  the  following  conditions: 

(a)  It  is  B  for  some  j,  1  <  j  <  i. 

j 

(b)  It  is  a  d-merge,  and  the  merge  literal  is  the  literal 
resolved  upon. 

(c)  An  instance  of  the  resolvent  B  is  identical  to  an 

i 

instance  of  the  clause  obtained  by  deleting  the  literal 
resolved  upon  frran  the  immediate  parent  B 

i-1 

Proof :  The  strategy  for  the  proof  will  be  roughly  this:  We  shall 
select  successively  larger  subgraphs  of  G.  For  each  G^,  we  shall  show 
how  to  construct  a  deduction  d'  generating  a  graph  G'  ^  G^  such  that  d' 
has  properties  (2)  and  (3)  (assuming  G  is  replaced  by  G^  in  the  statement 
of  the  theorem).  Moreover,  will  be  derived  from  simply  by  extend¬ 

ing  the  deduction,  i.e.  by  using  the  resultant  c'  of  G'  as  a  starting  clause 

i  i 

and  "adding"  (successively  resolving  with)  initial  clauses  from  that 

did  not  occur  in  G. .  Eventually,  for  some  j,  G.  will  be  the  complete 

^  J 

graph  G,  at  which  time  D'  ^ill  be  the  required  D'. 

The  proof  is  by  induction  on  the  number  of  merges  in  G  .  By  Lemma  3 
the  theorem  is  true  for  any  graph  with  no  merges  (because  the  fishtail 
property  is  satisfied) .  Now  assume  it  is  true  for  any  graph  with  fewer 
than  n  merges,  and  assume  G  has  n  merges.  Choose  any  starting  clause 
Bj^  in  G.  Let  G^  be  the  largest  subgraph  of  G  for  which  there  exists  a 
fishtail  construction  with  starting  clause  B^.  is  the  resultant  of 
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If  =  G,  we  are  through,  because  by  Lemma  3  there  is  an  appropriate 

fishtail  deduction.  Otherwise,  choose  some  free  literal  L  (from  a  free 

node)  in  G^  that  is  double-bar  linked  to  the  dot-connected  node  £  = 

{L^,...,L^],  n  ^  2,  of  literals  in  G^,  a  subgraph  of  G  disjoint  from 

Gj^.  Each  corresponds  to  a  node  from  a  different  initial  clause.  (L 

must  exist  because  the  fishtail  construction  of  G^  cannot  be  continued.) 

* 

Let  G  be  the  subgraph  of  G  consisting  of  G  and  G  (Figure  10a) .  We 
^  J.  X 

shall  show  how  to  construct  a  deduction  D'  generating  a  graph  Gj  such 

5  5 

that  d'  and  G^  satisfy  the  theorem  (if  G^  is  the  entire  initial  graph  G) . 

5  5  5 

Reduce  the  merge  £  as  described  in  Lemma  2,  to  produce  G_  s  G 

D  D 

(Figure  10b) .  The  required  deduction  n'  generates  a  graph  dominating 
G  and  begins  as  follows;  Construct  the  fishtail  deduction  starting 

D 

from  of  a  graph  dominating  G^ .  Resolve  the  resultant  with  the 

* 

initial  clause  in  G  containing  L  (a  fishtail  step),  forming  G  ,  whose 

X  J.  M 

resultant  clause  is  C  .  Now  consider  G  ,  the  subgraph  of  G  shown  in 

^  o  o 

*  ♦ 

Figure  10c,  It  consists  of  G  and  a  subgraph  of  G  (namely,  G  without 

^  -LX 

the  merge  £  or  the  initial  clause  containing  L^) .  Since  the  former  sub¬ 
graph  G  is  deducible  (by  the  deduction  of  C  already  described),  we  may 
think  of  it  as  if  it  were  simply  a  single  initial  clause  C  . 

A 

Since  G  now  contains  C  and  a  subgraph  of  G  from  which  a  merge  £ 

O  " 

has  been  deleted,  G  certainly  has  less  than  n  merges.  Therefore,  by 

O 

the  inductive  hypothesis  there  exists  a  deduction  d'  with  starting  clause 

3 

C  ,  generating  a  graph  G'  ^  G  ,  that  satisfies  all  the  conditions  (2) 

^  o  o 

and  (3)  of  the  theorem. 
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We  now  ask,  does  dominate  G^?  If  so,  the  deduction  thus  far — 

the  deduction  of  C  followed  by  d' — is  the  required  Unfortunately, 

2  3  5 

g'  does  not  necessarily  dominate  G_ ,  ^  G  S  G~  ,  as  required;  and 

the  literals  associated  with  nodes  of  G  are  a  subset  of  those  of  G  , 

3  5 

except  that  G'  may  contain  free  literals  of  X.  Since  g"  has  no  free 
-  3  5  — 

literals  of  X,  the  resultant  of  does  not  necessarily  subsume  the 

resultant  of  G  .  We  first  factor  together  these  troublesome  literals, 

5 

forming  a  single  node.  The  rest  of  the  proof  is  concerned  with  "getting 
rid  of"  this  node. 

One  way  to  complete  the  deduction  is  to  resolve  C^,  the  resultant 
of  Gg,  with  a  variant  of  the  clause  that  appears  earlier  in  the  deduc¬ 
tion,  using  the  literals  of  X  in  against  the  variant  L  of  L  in 

(Figure  10b) .  However,  this  step  would  not  generally  satisfy  condition 
(3b)  of  the  theorem.  Therefore  we  must  be  somewhat  less  direct. 

Consider  61.  We  may  replace  its  subgraph  G  by  the  single  "initial" 

O 

J 

clause  C  .  (Note  that  the  remainder  of  G_  consists  simply  of  G, .)  Let 
3  5  1 

G^  be  the  largest  subgraph  remaining  in  for  which  there  exists  a  fish¬ 
tail  construction  with  as  starting  clause.  Let  be  the  fishtail 
deduction  of  a  graph  dominating  G^(Lemma  3)  (Figure  lOd) . 

Finally,  consider  the  case  in  which  could  not  generate  all  of 
the  subgraph  G^.  The  fishtail  process  cannot  continue  only  if  the 
resultant  of  the  deduction  thus  far  must  be  resolved  with  a  merged  set 


of  literals  in  the  remaining  part  of  the  graph.  Recall  that  actually 
includes  deductions  that  start  from  and  generate  graphs  dominating  G^, 
then  G  ,  G  ,  and  finally  G  .  Let  us  number  the  steps  of  D'.  Let  C'  be 

o  4  ft  ft 

the  name  of  the  final  clause  deduced  by  and  assume  is  deduced  at 
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step  k  -  1.  At  step  k,  we  would  like  to  resolve  against  the  resultant 
of  a  subgraph  of  containing  a  particular  merge  (merge  ^  in  Figure  10b) 
a  nonfishtail  step.  However,  since  was  generated  by  a  fishtail  deduc¬ 
tion,  that  merge  must  have  been  formed  in  an  immediate  parent  during  that 
deduction;  and  that  deduction  is  the  first  part  of  D^.  Therefore  a 
suitable  clause  for  completing  the  deduction  already  exists  as  one  of  the 

B.,  1  <  j  <  k  (property  (3a)).  (In  Figure  lOe,  we  have  drawn  a  copy  C 
J  k’ 

of  this  clause  C,  to  show  the  final  resolution.)  Note  that  the  resolu- 
k 

tion  must  be  performed  upon  a  merge  literal,  thereby  satisfying  property 
(3b)  . 

Let  be  the  substitution  required  for  this  resolution  at  step  k, 

and  be  a  substituion  that  merges  corresponding  literals  that  are  left 

in  the  two  copies  C,  and  C,  appearing  in  the  final  resolvent  R  =  c' . 

k  k  k  5 

Then  applied  to  would  make  it  identical  to  the  clause  obtained  by 

applying  o  to  the  iinmediate  parent  after  deleting  the 

literal  resolved  upon,  satisfying  property  (3c) . 

Finally,  suppose  G  is  not  the  complete  starting  graph,  and  instead 

o 

G  contains  additional  subgraphs.  Because  of  the  definition  of  G^^,  these 

additional  subgraphs  must  contain  merge  literals  double-bar  linked  to  G  , 

o 

We  may  replace  G  in  G  by  the  resultant  of  G',  and  continue  the  con- 

O  u  O 

struction  of  the  required  deduction  by  "adding"  to  D^,  i.e.  by  treating 

o 

as  the  starting  clause, 
o 

Tautologies 

For  purposes  of  efficiency  in  a  proof  strategy  based  on  Theorem  2, 
it  would  be  desirable  to  add  the  following  condition: 

(2d)  No  is  a  tautology. 
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Unf ortunately^  this  is  not  true  under  the  premises  that  G  is  generated 
by  any  deduction  from  any  initial  set  S  and  B  may  be  chosen  arbitrarily. 

I-  - 

For  example^  if  the  middle  clause  (p  V  q)  is  selected  as  in  the  pro- 
positional  graph  of  Figure  11,  then  a  tautological  intermediate  clause 
must  be  generated  before  obtaining  the  resultant  (~p  V~q  Vr  Vs). 

As  we  mentioned  in  the  discussion  after  Theorem  1,  the  tautological 
subgraphs  may  be  eliminated;  in  this  particular  example,  either  of  the 
two  terminal  clauses  subsumes  the  resultant .  The  problem  here  is  that 
the  designated  starting  clause,  p  V  q,  is  essentially  irrelevant  to  the 
desired  result.  If  we  are  interested  in  refutations ,  i.e.  deductions  whose 
resultant  C  is  the  empty  clause  (graphs  containing  no  free  nodes) ,  then 
following  Anderson  and  Bledsoe  (1970)  we  could  require  that  the  initial 
set  S  be  minimally  unsatisfiable .  This  means  that  S  is  unsatisf iable, 
but  for  any  clause  C  g  S,  S  -  |c|  is  satisfiable.  Therefore  the  graph 
of  any  refutation  of  S,  including  those  whose  tautological  subgraphs 
have  been  eliminated,  must  contain  an  occurrence  of  every  clause  in  S. 

Thus  we  can  assert  the  following  theorem: 

Theorem  2^ :  Let  G  be  a  resolution  graph  generated  by  a  refutation 
from  a  minimally  unsatisfiable  set  S.  Then  there  exists  a  refutation 
generating  G^  that  satisfies  conditions  (2)  and  (3)  of  Theorem  2,  and 

also  satisfies  the  conditions  that  no  B  is  a  tautology. 

1 

Proof ;  First  eliminate  all  tautological  subgraphs  from  G,  producing 

A 

a  refutation  graph  G.  Since  S  is  minimally  unsatisfiable,  any  starting 
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clause  chosen  from  S  must  occur  in  G.  Use  G  as  the  given  graph  in 
the  proof  of  Theorem  2,  Since  that  proof  always  replaces  subgraphs  by 
subsuming  clauses  and  a  tautology  does  not  subsume  any  nontautological 
clause^  no  tautologies  are  introduced  during  the  proof  process. 

VI  CONCLUSIONS 
Relation  to  Previous  Work 

Theorem  2  establishes  that  if  a  resolution  deduction  exists  at  all, 
then  one  exists  that  simultaneously  satisfies  several  conditions. 

Clearly  an  assertion  that  a  deduction  exists  satisfying  only  some  of 
these  conditions  would  be  a  corollary  of  Theorem  2.  Therefore,  we  have 
just  proven  the  completeness  of  all  the  following  proof  strategies : 

1.  Property  (2a)  establishes  that  any  single  initial  clause 
occuring  in  a  resolution  proof  is  a  sufficient  set-of -support  (Henschen, 
1968)  . 

2.  Properties  (2)  and  (3a)  constitute  the  "ancestry 

filter"  described  by  Luckham  (1969) . 

3.  Property  (3c)  is  essentially  the  subsumption  condition  described 
by  Loveland  (1970) — and  shown  by  Loveland  to  be  compatible  with  ancestry 
filter  and  set-of-support . 

4.  Property  (3b)  is  essentially  a  statement  of  Andrews'  (1968)  merge 
condition  (shown  by  Andrews  to  be  compatible  with  set-of-support). 

Therefore  the  main  results  of  this  paper  were  to  establish  that  all 
these  strategies  could  be  used  simultaneously  without  losing  completeness. 
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and  to  give  some  insight  (by  means  of  the  resolution  graph  notation) 
into  the  significance  of  using  each  of  the  strategies. 

Proof  Strategy 

Consider  the  following  strategy  §  for  proving  a  theorem  from  a  set 
of  axioms  G  by  resolution; 

1.  Let  S  be  the  set  of  clauses  obtained  by  placing  all  the 
members  of  G^  and  the  negation  of  the  theorem,  in  quantifier-free 
conjunctive  form. 

2.  Choose  any  clause  in  S  that  is  known  to  be  needed  in  the  proof,  usu¬ 
ally  a  clause  from  the  negation  of  the  theorem,  as  the  first  clause  B^. 

3.  For  each  sequence  B, ,  B. ,  consider  as  successor 

X  ^  1 

clauses  every  B  that  satisfies  all  the  properties  (1)  and  (2)  of 
i+1 

Theorem  2  (and,  if  S  is  minimally  unsatisfiable,  the  additional  condition 
of  Theorem  2^),  This  defines  a  tree  of  deductions. 

4.  Choose  any  algorithm  for  exhaustively  searching  the  tree,  e.g., 
breadth -first,  or  unit  preference  with  level  bound  (Wos  ©t  al.,  1964) . 

Apply  the  algorithm. 

5.  A  deduction  of  □  ( a  "refutation")  constitutes  a  proof  of  the 
theorem. 

If  any  refutation  containing  the  clause  B^  exists,  one  will  be  found 
by  §..  Moreover,  the  nodes  in  the  deduction  tree  have  fewer  successors  than 
those  of  trees  corresponding  to  less  restrictive  strategies;  one  hopes  that 
this  reduction  in  successors  may  reduce  the  total  effort  needed  to  find  a 
proof.  However,  refutations  that  satisfy  all  the  properties  (1)  and  (2) 
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are  usually  longer  than  refutations  that  do  not,  and  the  relative  sizes 
of  the  trees  to  be  generated  remain  an  open  question. 

The  construction  of  a  proof  may  be  viewed  as  a  tree -searching  problem. 
Property  (1),  the  linear  format,  essentially  defines  a  class  of  deduction 
trees  to  be  searched.  The  conditions  of  property  (2)  define  the  successor 
function.  Theorem  2,  and  thus  step  4  of  §,  have  not  treated  the  problem 
of  selecting  a  good  algorithm  for  attempting  to  find  a  refutation  in  the 
tree — and  yet  this  algoritlun  may  have  a  profound  effect  on  the  effectiveness 
of  the  search.  Perhaps  semantic  heuristics,  such  as  some  kind  of  model 
partition  strategy  (Luckham,  1968)  can  be  embodied  into  this  algorithm 
without  losing  completeness.  Another  possibility  is  that  suitable 
bounds  can  be  found  to  enable  practical  use  of  the  optimum  tree -searching 
strategy  A  (Hart  et  al.,  1968).  Kowalski’s  paper  (Kowalski, 

1970)  discusses  this  problem. 

Further  improvements  in  theorem-proving  strategies  might  be  obtained 
by  studying  the  topological  properties  of  resolution  graphs.  For  example, 
the  construction  in  the  proof  of  Theorem  2  involves  transforming  a 
portion  of  a  graph  into  one  that  is  in  a  sense  topologically  simpler. 

In  the  extreme  case  of  a  graph  with  no  loops,  the  stronger  result  of 
Lemma  3  is  possible. 

Theorem  2,  property  (2a)  states  that  any  clause  that  is  used  in  a 
proof  may  be  used  as  the  top  clause  i.e.,  is  a  sufficient  set  of 
support.  However,  the  choice  of  this  clause  may  have  a  drastic  effect 
on  the  length  of  the  shortest  deduction  satisfying  the  rest  of  the 
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properties.  For  example,  suppose  g  cousists  of  the  following  clauses: 


and  the  negation  of  the  theorem  of  two  clauses: 


Nl.  L  P  ,  Q  I 
(  a"^  a) 

N2.  S  I 

I 


If  Nl  is  chosen  as  we  can  get  the  refutation  of  Figure  12a,  whose 
graph  is  Figure  12b,  On  the  other  hand,  if  N  =  ^  the  refutation  of 

A  X 

Figure  13a,  graphed  in  Figure  13b,  is  probably  the  shortest  one  that 
satisfies  the  conditions  of  Theorem  2.  In  general,  given  the  resolution 
graph  of  a  deduction,  one  may  be  able  to  establish  on  a  purely  topological 
basis  the  lengths  of  equivalent  deductions  that  use  particular  strategies 
or  support  sets. 

Another  potential  use  for  these  graphs  is  as  a  basis  to  some  new 
heuristic  procedure  for  guiding  the  construction  of  a  refutation.  The 
graph  of  a  refutation  contains  no  free  literals.  The  graph  of  an 
intermediate  stage  of  a  deduction  therefore  contains  free  literals  that 
must  be  eliminated  or  "resolved  away"  in  order  to  complete  the  proof. 

Since  the  graph  structure  contains  more  information  than  exists  in  its 
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resultant  clause^  perhaps  a  better  strategy  can  be  found  than  the  usual 
unit  preference  or  fewest  component  strategies. 

Finally,  let  us  consider  the  notion  of  "single  connectedness."  Wos 
et  al  (1967)  define  a  resolution  procedure  to  be  singly  connected  provided 
no  clash  is  generated  in  more  than  one  way,  where  clash  may  be  defined 

now  as  a  clause  whose  resolution  graph  consists  of  n  initial  clauses,  each 
resolved  with  a  different  literal  from  one  initial  clause  of  length  greater 
than  n.  (By  Theorem  1,  all  clashes  that  generated  the  same  graph  are  equiv¬ 
alent.)  A  somewhat  stronger  property  would  be  the  following: 

Definition;  A  resolution  procedure  is  strongly  singly  connected 

iff  it  never  produces  two  different  deductions  that  generate  the  same 
resolution  graph. 

Since  generation  of  the  same  (resultant)  clause  by  alternate, 
equivalent  deductions  is  a  major  cause  of  wasted  effort  in  resolution 
procedures,  strong  single  connectedness  is  an  extremely  desirable 
property.  Perhaps  the  concept  of  resolution  graphs  can  be  the  basis  for 
a  bookkeeping  procedure  for  achieving  this  property  in  general.  (We 
are  aware  of  existing  bookkeeping  procedures  that  only  work  in  unit 
resolutions.)  Unfortunately,  in  order  to  test  whether  a  proposed  deduction  is 
following  a  previously  attempted  path  in  this  way,  we  would  require  an 
algorithm  for  testing  whether  the  current  resolution  graph  is  a  subgraph 
of  any  of  a  set  of  other  previously  established  graphs.  We  know  of  no 
algorithm  for  this  at  present  that  is  sufficiently  efficient  to  be  practical. 
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a.  Possible  Graphs 


b.  Structures  That  Cannot  be  Graphs 
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FIGURE  4  EXAMPLES  OF  SUBGRAPHS 


FIGURE  5  GRAPH  DOMINANCE 


FIGURE  8  MERGE  REDUCTION 
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